Poincare recurrences of DNA sequence 
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We analyze the statistical properties of Poincare recurrences of Homo sapiens, mammalian and 
other DNA sequences taken from Ensembl Genome data base with up to fifteen billions base pairs. 
We show that the probability of Poincare recurrences decays in an algebraic way with the Poincare 
exponent /3 ~ 4 even if oscillatory dependence is well pronounced. The correlations between recur- 
rences decay with an exponent v « 0.6 that leads to an anomalous super-diffusive walk. However, 
for Homo sapiens sequences, with the largest available statistics, the diffusion coefficient converges 
to a finite value on distances larger than million base pairs. We argue that the approach based on 
Poncare recurrences determines new proximity features between different species and shed a new 
light on their evolution history. 
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The Poincare recurrence theorem of 1890 [l[ states that 
after a certain time a dynamical Hamiltonian trajectory 
in a bounded phase space always returns to a close vicin- 
ity of an initial state. Even if recurrences definitely take 
place the question about their properties, or more exactly 
what are the statistics of Poincare recurrences, and what 
are their correlation properties, still remain an unsolved 
problem for systems of dynamical chaos even after an 
impressive development of the theory of dynamical com- 
plexity dn. The two limiting case of periodic and fully 
chaotic motion are well understood: in the first case the 
recurrences are periodic while in the latter case the prob- 
ability of recurrences Pit) with time being larger than t 
drops exponentially at t — >■ oo Thus, the latter case 

is similar to a coin flipping, where a probability to stay 
on the same side after more than t flips decays at 2~*. 
However, in generic Hamiltonian systems the probability 
P(t) decays algebraically with t, as P(t) ~ 1/f 9 , due to 
long trappings in a vicinity of stability islands showing 
the Poincare exponent (3 » 1.5 [51-floj]. A detailed the- 
oretical explication of this slow algebraic decay is still 
lacking. Usually, the consecutive recurrences in dynam- 
ical systems are not correlated since a trajectory passes 
across domains of chaotic component. 

The Poincare recurrences represent a powerful tool for 
analysis of statistical properties of symbolic trajectories 
of various types 0-0] ■ Surprisingly, this powerful tool 
of dynamical systems has not been applied for detailed 
statistical studies of DNA sequence which also can be 
viewed as a symbolic trajectory. There have been only 
a few earlier attempts going in this direction including 
researchers in dynamical systems H| and bioinformat- 
ics 
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However, in |11| only short recurrence times 
with t < 4 have been considered and it was concluded 
that the probability of recurrences decays exponentially. 
The studies in bioinformatics were not aware about the 
concept of Poincare recurrences but their approach had 
certain links with them aiming to use digital signal rep- 
resentations of genomic data 12j . The relative frequency 



analysis applied in ljl, 14 1 has certain similarities with 
the Poincare recurrences approach but the distance times 
still remain very short with t < 20 in [l3} and t < 100 
in [3]. No detailed comparative analysis with exponen- 
tial decay of Poincare recurrences of random sequences 
or algebraic decay was presented there. 

In this work, we apply the powerful approach of 
Poincare recurrences to available mammalian DNA se- 
quences taken from the publicly available database fljj . 
The comparison with random data sequences and the 
known results for dynamical maps |5l— ll 0I| allowed us to 
establish new interesting features for the Poincare re- 
currences of DNA sequence. Our approach allowed to 
analyze the recurrences with time t being by 5 to 6 or - 
ders of magnitude larger than those reached in 
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For Homo sapiens (HS) database we performed statisti- 
cal analysis for 1.5 • 10 10 base pairs (bp). This amount 
of statistical data is by 4 — 5 orders of magnitude larger 
compared to the previous studies of anomalous diffusion 
performed in [16H18| for DNA sequences. Using this large 
statistics we find that the DNA Poincare recurrences are 
characterized by an algebraic decay with /3 « 4 for HS 
database. For such a value of the Poincare exponent /3, 
the uncorrelated recurrences should lead to a usual diffu- 
sive random walk with a linear growth of the correspond- 
ing second moment a ~ Dt 0, @] , with an effective time 
t given by the sequence length L measured in number of 
bp. At the same time the early studies for random walk in 
DNA sequences [H-El, with the total length of t < 10 6 
bp, established that such a walk belongs to the Levy 
type walks, with an anomalous superdiffusive growth of 
the second moment a ~ t 1+fJ - and a growing diffusion 
coefficient D(t) — a/t ~ t^ with \i > 0. Our studies 
show that this apparent contradiction is resolved by the 
presence of long range correlations Cp(t) between the 
Poincare recurrences in DNA that makes them different 
compared to dynamical chaos systems where such corre- 
lations are usually absent [5l-[lol|. We show that Cp(t) is 
characterized by a global algebraic decay with an expo- 
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nent v w 0.6. Such a slow decay leads to an anomalous 
super-diffusion on scales of t < 10 6 bp with the exponent 
[i being in agreement with the previous studies |16l4l8l |. 
However, for t > 10 6 bp the diffusion coefficient D(t) for 
HS becomes finite due to cancellations of odd and even 
correlation terms which show a global algebraic decay 
with an exponent v k, 0.6. We argue that the obtained 
results for the statistics of Poincare recurrences of DNA 
sequence open new possibilities for the genome evolution 
analysis. 

To study the statistics of Poincare recurrence of mam- 
malian DNA sequences we use the enormous database 
15] considering a DNA sequence as a very long trajec- 
tor y in the space of four nucleobases A, G, C, T. Similar 
to [16J, a walk along the DNA sequence length, marked 
as an effective time t, is described by a discrete variable 
u(t) which takes values "+" for A, G of purine domain 
and "-" for C, T of pyrimidine domain (AG-CT). The 
differential distribution of Poincare recurrences pi(t) is 
given by a relative number of segments of fixed sign of 
length t while the integrated distribution P(t) gives the 
relative number of recurrences with times larger than t. 
The probabilities of domains AG and CT are close to 
0.5 for HS and mammalian sequences. Thus the recur- 
rences for both domains are very close to each other so 
that we show one average distribution P(t) for AG-CT 
corresponding to recurrences or crossings of line u = 0. 
A similar situation takes place for AC and GT domains 
so that we show for them one average distribution P(t) 
for AC-GT. For domains AT and CG the probabilities 
are approximately 0.6 and 0.4 and here we show sepa- 
rately recurrence probability P(t) for AT and CG do- 
mains. For Poincare recurrences P(t) of HS sequences 
these four cases are shown in Fig. Q] (left panel). In aver- 
age we find an algebraic decay P(t) ~ l/t® with /3 » 4. A 
formal fit for AG-CT data at t > 10 gives /3 = 3.68 ±0.02 
but there are visible large scale oscillations with a certain 
similarity to those seeing in dynamical maps [H, 0, H} . The 
dependence P(t) = 2 _< for a random sequence describes 
AG-CT and AC-GT data only on short times t < 5 while 
for larger times algebraic behavior becomes dominant. 
We note that P{t) is a positively defined quantity and 
thus it is statistically very stable: the sequences of size L 
well reproduce the initial part of P(t) almost up to values 
~ 1/L as it is shown in Fig. [TJ (bottom left panel), where 
L varies in a large interval of 10 5 < L < 1.5 • 10 10 bp. 

The comparison of statistics of Poincare recurrences 
for HS, mammalian and two other species are shown in 
Fig. [TJ for AG-CT case (similar average behavior is found 
for AC-GT data). The total sequence lengths L for other 
species are by a factor 3 shorter compared to HS case. Up 
to t ps 20 all considered species show the same decay of 
P(t) but at larger value of t there is a separation of curves 
so that each species is characterized by its own statistics 
P(t). In average all species show an algebraic decay with 
f3 w 4 even if there is a strong oscillation with a flat region 



10"" 
10" 6 



10"- 



10"" 
10" 6 





AG-CT 
AC-GT 
\ AT 

\ CG 

\ 






\ 


—4 

V* ■ 


2-''', 




s 

\ \ 
l\_ \ 


1 10 , 


100 


1000 



L= 


1.5-10 1 ? 






L 


=10" 






L 


=10° 






L 


= 10 6 






L 


=10^ 






L 


=10 




■ HS AG-CT 












10 



100 1000 



10 100 , 1000 10000 



FIG. 1: Statistics of Poincare recurrences P(t) for DNA se- 
quences. Top left panel: DNA data of Homo sapiens (HS) for 
Poincare recurrences of domains AG-CT, AC-GT, AT and 
CG (see text) . The lower dashed curve shows the exponential 
behavior P(t) = 2 valid for random sequences, the upper 
dashed line shows the average power law Pit) ~ t~ 4 for com- 
parison. Top right panel: AG-CT data for DNA sequences 
of the species: Felis catus (FC, Cat), Homo sapiens (HS, 
Human), Gorilla gorilla (GG, Gorilla), Canis familiaris (CF, 
Dog), Loxodonta africana (LA, Elephant), Xenopus tropicalis 
(XT, African Clawed Frogs) and Danio rerio (DR, Zebrafish). 
Bottom left panel: Convergence of the statistics of AG-CT 
Poincare recurrences P(t) for Homo sapiens as the length L 
of the considered DNA sequence increases from L = 10 5 to 
L = 1.5 • 10 10 . Bottom right panel: AC-GT data sets for the 
same species as in the top right panel. 



of P{t) for GG sequence (AC-GT data from Fig. Q] show 
a very similar behavior in this case). It is interesting 
to note that the curves of Poincare recurrences are very 
close for HS and GG sequences up to t w 200 and for HS 
and FC sequences up to maximal t ss 10 3 . However, for 
AC-GT data set the curves for these sequences become 
different for t > 20 (Fig. [TJ. 

It is important to understand how the statistics of 
Poincare recurrences is related to the anomalous super- 
diffusive walk discussed in 

EMI- The walk is de- 
scribed by a displacement variable y(t) = J2r=i U ( T ) 
whose growth can be characterized by a diffusion coef- 
ficient defined as D(t) = a/t with the second moment 
a = (Ay(t) 2 ), Ay{t) = y(t+t )-y{t ) - (y(t+t )-y(t )) 
and the average (• • • ) is done with respect to the initial 
position (or "time") t . In case of a standard diffusive 
process the diffusion coefficient D converges to a finite 
value at large times. However, the results of [16| give an 
algebraic super-diffusive growth D(t) ~ i M with the ex- 
ponent fi ss 0.34 for HS sequence of length L ~ 10 5 and 
t < 10 3 . Our results are obtained on a significantly larger 
scale of t being by 4 orders of magnitude larger compared 
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FIG. 2: Top left panel: Diffusion coefficient D(t) = {Ay 2 (t))/t 
for AG-CT data sets of DNA sequences of HS and CF. The 
lower green curve (non-cor) is the diffusion coefficient ob- 
tained for a model with individual recurrences being dis- 
tributed as the Poincare recurrences of HS in Fig. [Tj but as- 
suming that subsequent Poincare recurrences are not corre- 
lated. The black crosses (sim-cor) represent the diffusion co- 
efficient obtained from (|3| using the Poincare recurrence cor- 
relation function Cp(n) for HS (see text and Fig. [4] below). 
The dashed line shows a power law D ~ t 0A . Top right panel: 
Diffusion coefficient D(t) for AG-CT data sets of the same 
species as in the right panel of Fig. [TJ Bottom panels: Diffu- 
sion coefficient D(t) for AC-GT data sets for the same cases 
as in top panels; the dashed line in the left panel represents 
a power law dependence D(t) ~ t 6 . 

to those reached in (l6Wlj|. Our results for diffusion D(t) 
are shown in Fig.[2j For HS sequence we have large statis- 
tics and large exact segments without non-determined bp 
marked as N in database [l^]. We find /x « 0.4 for the 
range 10 < t < 10 6 (fit gives /x = 0.349 ± O.OOlHn 
a satisfactory agreement with previous studies [16H18I ]. 
Other species also show an algebraic growth of D(t) with 
similar values of /i (Fig. [5]). For AC-GT data we also 
find a similar behavior with \i » 0.6 for HS sequence 
(Fig. [2]). However, for HS sequence with most exact and 
long data set we find a saturation of D(t) for large times 
10 6 < t < 10 7 . 

The diffusion coefficient is related to the correla- 
tion function c(t) = {u{t + t )u(t )) as D{t) — 
(1/t) Ya=i 2,-=-i+i c 0) an d hence a divergence of D im- 
plies a slow correlation decay c(t) ~ t^ 1 if c(t) is mono- 
tonic. On the other hand this correlation function can 
also be expressed as 

oo oo 

C(*) =J2(- 1 Y l ~ 1 J2 {tl+ ■■■+t n -t) Pn (t 1 , ...,t n ) 
n=l ti+. ..+«„>* 

(1) 

where p n (ti, . . . , t n ) is the joint distribution of n subse- 
quent Poincare recurrence times t±, . . . , t n . In this sum 



each term represents the case where n subsequent recur- 
rences are needed to cover the interval 0, 1, . . . , t and the 
prefactor t\ + . . .+t n —t accounts for the number of differ- 
ent initial positions of the first recurrence to allow this. 
If we assume that subsequent Poincare recurrences are 
not correlated, i. e.: p n (h, t n ) = pi(ti) • . . . • Pi(t n ), 
and that P{t\) obeys the power law P(t{) ~ t^[ , i. e.: 
Pl(ti) = P(h) - P(h + 1) - t^~ l we find that in the 
above expression the first term for n = 1 dominates the 
limit t — ¥ oo and we find that c(t) « St°=t+i -^(^i) ~ 
tP{t) ~ t 1 ^ 13 . We mention that this result was previously 
also obtained for chaotic Hamiltonian dynamics 0, [tJ- 
Therefore we should have a good convergence of D with 
(3 w 4. However, this relation is obtained for the case 
of uncorrelated Poincare recurrences that may not be the 
case for DNA sequences. Indeed, if we generate uncor- 
related recurrences with the distribution Pit) being the 
same as in Fig. Q] for AG-CT sequence of HS and compute 
with them the diffusion coefficient then we find a clear 
saturation of D(t) at a finite value D = 1.77 (green curve 
in Fig. [2 left panel) , being significantly smaller then the 
actual data of D(t) ~ 100. 




FIG. 3: Left panel: Density plot of the normalized two point 
correlator p~2(ti,t2) of two subsequent Poincare recurrences 
ti and t2 for AG-CT data sets of HS. The shown range 
1 < ti,ta < 12 represents 99.4% of probability. Red (green, 
blue) color represents maximal (zero, minimal) values, hori- 
zontal/vertical axes show t\ and fa. Right panel: Normalized 
two point correlator f>2(ti, £3) of ti and ts for three subsequent 
Poincare recurrences ti, fa, £3 with ti and t^ on the axes. 

To visualize the correlations between Poincare recur- 
rences we also compute the joint probability pi(ti,t 2 ) of 
two subsequent Poincare recurrences t\ and ti for HS 
sequence of Fig. [TJ The normalized two point corre- 
lator is pi{t\M) = P2{ti 1 t 2 )/[pi{ti)pi(t 2 )} - 1, where 
Pi(ti) = P{t\) — P{t\ + 1) is the probability of one indi- 
vidual recurrence of length t\. Its dependence on t\, t% is 
shown in Fig. [3] The correlator is maximal for t\ = 1 
(i.e. below the average recurrence time (ti) — 2.27) 
and ti > 8 (i.e. above average) or vice-versa thus in- 
dicating anti-correlations between t\ and t 2 . In the right 
panel of Fig. [3] we show the normalized two point corre- 
lator pi(ti, £3) for t\ and £3 taken from three subsequent 
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Poincare recurrence times t±, t 2 , t 3 . In this case t\ and £3 
are correlated, i.e. if t\ is above average it is more likely 
that t 3 is also above average. 

Thus, in a sequence of Poincare recurrences 
ti, t2, t 3 , ... the odd elements represent steps of length 
ti, t^, ... of one sign of u(t) and the even elements rep- 
resent steps of length 1%, £4, ■ • ■ of the other sign. The 
anti-correlations between t\ and t% or ti and £3 as well 
as the correlations between t\ and t 3 indicate that once 
a preferential direction is chosen it is more likely for it to 
be enhanced thus explaining the diffusion enhancement 
compared to the uncorrelated Poincare recurrences which 
give a hnite coefficient D w 1.7. To work out this point on 
a more quantitative level we consider the displacement af- 
ter n Poincare recurrences at time t — t\+. . -+t n ~ n{t\). 
We can write for it 
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where (— l) s is the sign of the first segment associated to 
t\. For n 3> 1 this leads to 

D(n(t 1 )) = - ? - T ^((7p(0) + 2^(-iyCp(i)) , (3) 
' { 11 1=1 j=i 

where Cp(j) — (ti ti+j}— (ti) 2 is the Poincare recurrence 
correlation function and the average is done over all re- 
currences p^j . We note that the above model of uncor- 
related Poincare recurrences corresponds to Cp(j) = 
for j > 0. In this case Eq.© gives D = C P (0)/(ti) = 
4.01/2.27 = 1.77 in a perfect agreement with the data of 

Fig. m 

The Poincare recurrence correlation function Cp(n) is 
computed from DNA sequence data and its dependence 
on recurrence index/number n w t/(ti) is shown in Fig. [4] 
for AG-CT data sets of HS and CF. For HS data this 
correlation function has alternate signs for odd and even 
n up to n rj 3 • 10 3 . For larger n values these terms 
have the same sign and moreover these terms become 
approximately equal for n > 10 5 . This leads to cancella- 
tion of odd and even terms in ([3]) and saturation of the 
growth of diffusion coefficient at t > 10 6 as it is clearly 
seen in Fig. [2j Such a saturation of D(t) takes place 
in spite of a rather slow algebraic decay of correlation 
Cp(n) ~ n~ v with v w 0.6 (for even terms an error 
weighted fit gives v = 0.575 ± 0.003 at 10 < n < 3 • 10 6 
and for odd terms v = 0.479 ± 0.005 at 10 < n < 10 3 ). 
From the found correlation function Cp{n) we can de- 
termine the dependence D(t) using §5$ that gives a good 
agreement with the data obtained by a direct compu- 
tation of D(t) as it is shown in Fig. [2] (deviations at 
t < 10 are due to an approximate validity of the rela- 
tion t = ti + .. + t n fts n{t\) at small t). We note that the 
relation between exponents fi = 1 — v, corresponding to 
a simple estimate D ~ t \ Cp{t)\, remains valid in absence 



FIG. 4: Top panels: Poincare recurrence correlation function 
Cp(n) = {ti in+i) — (ti) 2 of t\ and t n +i in a sequence of sub- 
sequent Poincare recurrences tj for HS (left panel) and CF 
(right panel) sequences for AG-CT data sets. Blue crosses 
correspond to even n and red squares to odd n. The dashed 
line shows a power law Cp(n) ~ n -0,6 . For clarity positive 
and negative values of Cp(n) are shown on two separate log- 
arithmic scales which are put together at Cp = ±10 -4 shown 
by the green line. Bottom panels: same as in top panels but 
for AC-GT data sets for HS (left panel) and CF (right panel); 
the dashed line shows the dependence Cp(n) ~ n~ 0A . 
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FIG. 5: Relative statistical error ACp(n)/\Cp(n)\ of the 
Poincare recurrence correlation function for HS (left panel) 
and CF (right panel) sequences for AG-CT data sets shown 
in top panels of Fig. [4] Blue crosses correspond to even n and 
red squares to odd n. The green line indicates the value of 10 
%. 



of odd/even terms cancellation at t < 10 6 . For CF data 
set we find approximately the same algebraic decay with 
v w 0.6 (Fig. HI right panel). In this case the total num- 
ber of recurrences N r is statistically smaller compared 
to HS case and in addition undetermined letters ./V of bp 
are broadly scattered over the sequence. Due to that here 
we do not find large number N r (n) of recurrence times at 
large n that force us to stop at n < 2.5 • 10 5 where a satu- 
ration of D(t) growth is not visible (for HS case we have 
N r (n) w 5 • 10 9 recurrences at n = 10 6 but many of them 
are correlated and the statistical error of Cp(n) is about 
5% here while for smaller n it becomes smaller than the 
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symbol size in Fig. [4]). For AC-GT data sets, shown in 
Fig. |4l we find an algebraic decay with exponent v « 0.4 
corresponding to the value /x ss 0.6 from corresponding 
Fig.O The convergence of odd/even terms of Cp(n) for 
HS case takes place at n > 10 5 leading to saturation of 
diffusion rate at t > 10 6 also visible for AC-GT data 
(Fig. [2|. For CF data we have lower statistics for large 
n and t and saturation of D{t) remains invisible. 

The analysis of statistical accuracy of the computa- 
tion of correlation function Cp(n) is presented in Fig. [5] 
Here we show the variation of relative statistical error 
ACp(n)/\Cp(n)\ in the value of Cp{n) as a function of 
n. This error increases from a level of 10 -3 at n < 100 
up to 0.1 at n « 2 • 10 6 for HS and at n « 10 4 for CF (a 
strong increase of error at n ~ 3 ■ 10 3 for HS is related 
to a sign change of C(n)). The relative error increases 
with n since at large n we have smaller number of recur- 
rences N r contributing in the computation of C(n). For 
the HS case the number of non-determined N letters al- 
lows to have a significantly larger number of recurrences 
N r compared to the CF case and due to this we obtain 
statistically good values of C(n) at significantly larger 
values of n. 

Let us give now the formal fit parameter values for 
the dependencies discussed above. The fit of Poincare 
recurrences for the data of Fig. [T] at t > 10 gives the 
Poincare exponent /3 = 3.68 ± 0.02 (AG-CT), 3.65 ± 0.04 
(AC-GT), 3.75 ±0.03 (AT), 4.04 ±0.05 (CG). The fit of 
D(t) ~ f for AG-CT data of HS in Fig. Q] gives (j, = 
0.3486 ± 0.0008 for the range 10 < t < 10 6 but there are 
two intervals with distinct values n = 0.5010 ± 0.0003 for 
10 < t < 3 • 10 3 and (i = 0.2859 ± 0.0003 for 3 • 10 3 < 
t < 10 6 so that we give in the text the average /i « 0.4. 
For AC-GT data of HS the whole range of D(t) is well 
characterized by a fit exponent /i = 0.5553 ± 0.0004 for 
100 < t < 10 6 (see Fig. H}. Furthermore for AC-GT 
data the correlation function behaves also as Cp(n) ~ 
nr v where for HS the exponent obtained from an error 
weighted fit is v = 0.367 ± 0.004 for even terms and 
v = 0.320 ± 0.004 for odd terms, both at 10 < n < 10 4 . 
Even if formal statistical errors are quite small we should 
note that there are rather pronounced oscillations and 
due to that reason we give in the above discussions only 
approximate values of the exponents. 

The presented results determine the statistics of 
Poincare recurrences of DNA sequences and link their 
properties to the statistics of sequence walks studied pre- 
viously [16 18]. The anomalous diffusion of walks is re- 
lated to enormously long correlations between far away 
recurrences. For most detailed HS sequences the diffusion 
coefficient of these walks becomes finite due to cancel- 
lations of slow decaying correlations. For other species 
larger statistical samples are required to see if the dif- 
fusion coefficient saturation is present. The Poincare re- 
currences Pit) are statistically very stable and show clear 
difference between various species. The statistical anal- 



ysis of human and mammalian DNA sequences is now 
an active research field with links to genome evolution 
(see e.g. 2(3- 22 j) and the approach based on Poincare 
recurrences should bring here new useful insights. 

The obtained properties of Poincare recurrences can be 
used for verification of various theories of genome evolu- 
tion (see e.g. jUH^). Such theories should reproduce 
well the main statistical features of Poincare recurrences 
described here. Indeed, the data of Fig. [T]show that for 
t < 5 the recurrences for all analyzed species behave like a 
random sequence of coin flipping. Thus the genome evo- 
lution generates random uncorrelated short range recur- 
rences. However, for the range 5 < t < 20 we have a be- 
ginning of algebraic decay of P(t) but still all the species 
follow practically the same curve. This indicates an ex- 
istence of a common period of initial evolution history. 
For t > 20 we observe a strong divergence of Poincare 
curves of different species. Surprisingly the curves of HS 
and FC (as well as LA and XT) remain very close to each 
other up to largest recurrences with t « 400 for AG-CT 
data sets. At the same time for AC-GT data sets a close 
proximity of recurrences is observed for HS, LA and XT 
(as well as for FC and CF) up to largest values t « 300. 
This show various aspects of proximity between species 
which should be investigated in further studies. We hope 
that the new tool of Poincare recurrences will allow to 
analyze the proximity between species under a new angle 
lightening new sides of life evolution. 
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